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Hi stone Deacetylase 9 
Field of the Invention 

The present invention relates to the identification and 
5 isolation of a histone deacetylase and uses of this 

enzyme. In particular, the present invention relates to 
histone deacetylase 9 (HDAC9) , a member of the class II 
histone deacetylase family. 

10 Background of the Invention 

Eukaryotic DNA is packaged into chromatin in a precise 
and highly regulated manner. This level of organisation 
is fundamental to many processes in the cell including 
replication, repair recombination, chromosomal 

15 segregation and transcriptional regulation. DNA is 

wrapped around the core histone octamer (H2A, H2B, H3 and 
H4) to form nucleosomes, which are the basic repeating • 
units of chromatin. The crystal structure of the 
nucleosome has been solved and this has provided much 

20 information but there is still a great deal to learn 

about the mechanisms by which distinct functional domains 
of chromatin are formed and maintained. The most 
important changes in chromatin structure are thought to 
be influenced by post-translational modifications of N- 

25 terminal tails of the histones, which protrude from the 
nucleosome. These are highly basic when unmodified and 
interact with negatively charged DNA phosphate backbone. 
Specific amino acids within these tails are targets for a 
variety enzymes which can produce diverse modifications 

30 such as acetylation, methylation, phosphorylation, 

. poly (ADP-ribosylation) , and ubiquination. Acetylation is 
thus far the most widely studied and is catalysed by 
histone acetyltransf erases (HATs) and involves 
substitution of the e-amino group of specific lysines. 
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This leads to a more acidic residue and an overall 
decreased affinity for DNA by the histone octamer. It 
appears that the histone tails also mediate interactions 
between nucleosomes to form higher order chromatin 
5 structures and they could be disrupted by acetylation. 
The packaging of DNA into nucleosomal arrays presents a 
major physical obstacle to the transcriptional machinery 
when trying to gain access to the DNA template and there 
is strong evidence that unwinding of nucleosomes due to 

10 the acetylation of histone tails plays a fundamental role 
in the activation of gene transcription. In contrast to 
histone acetylation in transcriptional activation, 
• enzymes which remove these modifications would be 
expected to have an important role in down-regulation and 

15 gene silencing. This has indeed been shown to be the 
case and recent studies have also implicated abnormal 
histone deacetylase function in a number of human 
cancers. 

20 To date eight histone deacetylases have been 

characterised which may broadly divided into two related 
classes which share homology through their deacetylase 
domains with yeast histone deacetylases Rpd3 and Hdal. 
Class II HDACs 4, 5 and 7 may be differentiated from the 

25 Class I HDACs as they contain an additional N-terminal, 
non catalytic, region which is homologous to a protein 
previously characterized as a co-repressor, MEF2- 
Interacting Transcription Repressor (MITR/HDRP) . 

30 Zhou et al (PNAS, 98 (19) : 10572-10577, 2001) purports to 
disclose the sequence of histone deacetylase HDAC9 . 
However, the protein disclosed in the paper is incomplete 
and the level of deacetylase activity reported in the 
.paper is below the negative control. 
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Summary of the Invention 

Broadly, the present invention relates to the 
identification and cloning of histone deacetylase 9 
(HDAC9) , and in particular full length HDAC9 polypeptides 
5 . and HDAC9 polypeptides which have deacetylase activity, 
and to nucleic acid molecules encoding these 
polypeptides. The present invention further relates to 
uses of these polypeptides and nucleic acid molecules, in 
particular for screening for compounds that are capable 
10 of modulating a HDAC9 biological activity. 

Accordingly, in a first aspect, the present invention 
provides an isolated HDAC9 polypeptide having the amino 
acid sequence set out in Figure 2. This figure shows the 

15 cDNA and predicted amino acid sequence of wild-type 
histone deacetylase 9 (HDAC9) , a 1069 amino acid 
polypeptide. The HDAC9 sequence is overlaid with the 
sequence of MITR that shares a common N-terminal amino 
acid sequence with HDAC9 polypeptide up to amino acid 577 

20 at the end of exon 11, but has a different exon 12. The 
MITR sequence is a 593 amino acids long. 

The HDAC9 sequence reported in Zhou et al (2001) is 
identical to the sequence shown in Figure 2 up to amino 

25 acid Serl008, but does not include the C-terminal 61 

amino acids which are encoded by exons 24 and 25 as shown 
in Figure 2. Significantly, the polypeptide reported in 
Zhou et al substantially lacks histone deacetylase 
activity. In contrast, preferred polypeptides of the 

30 present invention possess a HDAC9 biological activity, 

and especially histone deacetylase activity, and include 
all or part of the amino acid sequence set out between 
amino acids 1009 and 1069 of Figure 2. Compared to other 
histone deacetylases, HDAC9 has 55% sequence identity to 



3 



WO 03/029451 



PCT/GB02/04455 



HDAC4 , 56% sequence identity to HDAC5 and 38% sequence 
identity to HDAC7 . 

In a further aspect, the present invention" provides an 
5 isolated polypeptide having at least 80% amino acid 

sequence identity with the amino acid sequence set out in 
Figure 2. Preferred embodiments of the invention provide 
. isolated polypeptides having at least 95% sequence 
identity with the sequence set out in Figure 2 . 

10 

In a further aspect, the present invention provides an 
isolated polypeptide which is encoded by a nucleic acid 
sequence which is capable of hybridising under stringent 
conditions to the nucleic acid sequence set out in Figure 
15 2, or a complementary sequence thereof, and in particular 
to the nucleic acid sequence comprising a sequence which 
is capable of hybridising to a nucleic acid encoding 
amino acids 1009 to 1069 of the HDAC9 polypeptide as set 
out in Figure 2, or a complementary sequence thereof. 

20 

In a further aspect, the present invention provides a 
substance which is a fragment, active portion or sequence 
variant of one of the above polypeptides. Preferred 
fragments and active portions comprise all or part of the 

25 HDAC9 sequence set out between amino acids 1009 and 1069 
inclusive as shown in Figure 2 . Sequence variants are 
defined below and include HDAC9 polypeptides in which one 
or more of the exons shown in Figure 2 are deleted, such 
as the variants in which exon 7, exon 12 and/or exon 15 

30 are deleted. 

In a further aspect, the present invention provides a 
HDAC9 polypeptide as defined above joined to a coupling 
partner. 
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In a further aspect, the present invention provides an 
isolated nucleic acid molecule encoding one of the above 
polypeptides, and complementary nucleic acid sequences 
5 thereof. The cDNA sequence of full length HDAC9 is shown 
in Figure 2. The present invention also includes nucleic 
acid molecules having greater than a 90% sequence 
identity with one of the above nucleic acid sequence. In 
other embodiments, the present invention relates to 

10 nucleic acid sequences which hybridise to the coding 

.sequence set out in Figure 2, or a complementary sequence 
thereof, and more especially to nucleic acid .sequences 
which hybridise to a nucleic acid sequence encoding amino 
acids 1009 to 1069 as set out in Figure 2, or a 

15 complementary sequence thereof. 

In further aspects, the present invention provides an 
expression vector comprising one of the above nucleic 
acid operably linked to control sequences to direct its 
20 expression, and host cells transformed with the vectors. 
The present invention also includes a method of producing 
a histone deacetylase 9 polypeptide, or a fragment or 
active portion thereof, comprising culturing the host 
cells and isolating the polypeptide thus produced. 

25 

In a further aspect, the present invention provides a 
composition comprising one or more of the above 
polypeptides or nucleic acid molecules as defined herein. 

30 In a further aspect, the present invention provides the 
use of a histone deacetylase 9 polypeptide as defined 
herein (including fragments, active portions or sequence 
variants), or a corresponding nucleic acid molecule, for 
screening for candidate compounds which (a) share a 
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histone deacetylase 9 biological activity or (b) bind to 
the histone deacetylase 9 polypeptide or (c) inhibit a 
biological activity of a histone deacetylase 9 
polypeptide. By way of example, screening can be carried 
5 out to find peptidyl or non-peptidyl mimetics or 

inhibitors of the HDAC9 polypeptides to develop as lead 
compounds in pharmaceutical research. 

Thus, in one embodiment, the present invention provides a 
10 method of identifying a compound which is capable of 
modulating an activity of a histone deacetylase 9 
polypeptide, the method comprising: 

(a) contacting at least one candidate compound with 
.a histone deacetylase 9 (HDAC9) polypeptide as defined 

15 herein under conditions in which the candidate' compound 
and HDAC9 polypeptide are capable of interacting; 

(b) determining in an assay for a HDAC9 activity 
whether the candidate compound modulates the activity; 
and, 

20 (c) selecting a candidate compound which modulates 

an activity of the HDAC9 polypeptide. 

In a further aspect, the present invention provides a 
method of identifying a compound which is capable of 
25 inhibiting histone deacetylase 9 (HDAC9) polypeptide, the 
method comprising: 

(a) contacting at least one candidate compound and a 
HDAC9 polypeptide as defined herein in the presence of a 
substrate for HDAC9 under conditions in which the 

30 candidate compound, HDAC9 polypeptide and HDAC9 substrate 
are capable of interacting; 

(b) determining whether the candidate compound 
inhibits the activity of the HDAC9 polypeptide in 
reacting with the substrate; and, 
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(c) selecting the candidate compound which inhibits 
the activity of the HDAC9 polypeptide on the substrate. 

In one embodiment, the substrate is a histone which is a 
5 substrate for HDAC9 . In this assay, the effect of a 
candidate compound in modulating the activity of HDAC9 
can be assessed by determining the amounts of acetylated 
and deacetylated histone present after the reaction of 
HDAC9 in the presence or absence of the candidate 
10 compounds. This can readily be determined using 

techniques well known in the art, for example employing 
antibodies which are capable of specifically binding to 
acetylated or deacetylated substrate. 

15 In this aspect of the invention, preferably the activity 
of the HDAC9 polypeptide is the activity of removing an 
.acetyl groups from the substrate. Conveniently, the 
progress of this reaction can be assessed by labelling 
the substrate with a detectable label (e.g. a radioactive 

20 label) and measuring the amount of label released from 
the substrate by the action of the HDAC9 polypeptide, 
e.g. in a scintillation proximity assay. Preferably, the 
method is for screening for modulators of HDAC9 which may 
be useful in for the treatment of cancer. 

25 

In a further aspect, the present invention provides a 
method of identifying a compound which is capable of 
inhibiting histone deacetylase 9 (HDAC9) polypeptide, the 
method comprising: 
30 (a) contacting at least one candidate compound and a 

HDAC9 polypeptide as defined herein in the presence of a 
substrate for HDAC9 under conditions in which the 
candidate compound, HDAC9 polypeptide and HDAC9 substrate 
are capable of interacting; 
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(b) determining whether the candidate compound 
inhibits the activity of the HDAC9 polypeptide in 
reacting with the substrate; and, 

(c) selecting the candidate compound which inhibits 
5 the activity of the HDAC9 polypeptide on the substrate. 



In one embodiment, candidate compounds are screened for 
activity in modulating HDAC9 in a cell based reporter 
assay using cells which produce HDAC9 fused to a nucleic 

10 acid binding domain such as GAL 4 , e.g. by transfecting 
cells with appropriate expression and reporter vectors. 
In the cells, HDAC9-GAL4 chimeric protein which is 
capable of interacting with a GAL 4 DNA binding site can 
bind to a promoter which includes such a site, the 

15 promoter being linked to a reporter construct. The HDAC9 
activity can thus deacetylate core histones associated 
■ with a DNA surrounding GAL 4 binding site resulting in 
down regulation of promoter activity. Compounds that 
inhibit HDAC9 activity would counteract this down 

20 regulation. Therefore, changes in reporter activity can 
be used as a screen for compounds that either stimulate 
or inhibit HDAC9 activity. 

In a further aspect, the present invention provides 
25 antibodies capable of specifically binding to the above 
HDAC9 polypeptides, or an active portion, domain or 
fragment thereof, and the use of the HDAC9 polypeptide or 
peptides based on the sequence for designing antibodies 
or for use in a method of preparing antibodies capable of 
30 binding to HDAC9. These antibodies can be used in assays 
to detect and quantify the presence of HDAC9 polypeptide, 
in methods of purifying HDAC9 polypeptides, and as 
inhibitors of HDAC9 polypeptides. 
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In a further aspect , the present invention provides a 
method of amplifying a nucleic acid test sample 
comprising priming a nucleic acid polymerase reaction 
with nucleic acid encoding a HDAC9 polypeptide as defined 
5 above - 

In a further aspect, the present invention provides a 
method for the diagnosis or prognosis of cancer, the 
method comprising determining the presence or amount of 
10 HDAC9 polypeptide, or an isoform thereof, or HDAC9 
nucleic acid in a sample from a patient. This is 
discussed in more detail below. 

These and other aspects of the present invention are 
15 described in more detail below. By way of example and 

not limitation, embodiments of the present invention will 
now be described in more detail with reference to the 
accompanying figures. 

20 Brief Description of the Figures 

Figure 1. Schematic representation of the rational behind 
the cloning of the 3 1 end HDAC9. EST accession nos. are 
.indicated on the left with position numbers on the right. 
HDAC9 sequence is shown in bold and highlighted in grey. 

25 Sequence identity is indicated by vertical lines. The 

EST database at the NCBI was screened with EST sequences 
corresponding to HDAC9 cDNA found through homology to 
HDAC5. A number of overlapping clones were obtained 
which enabled the cloning of HDAC9 through the design of 

30 specific primers. 

Figure 2. HDAC9 cDNA sequence together with the predicted 
HDAC9 amino acid sequence. HDAC9 full length cDNA has an 
open reading frame of 3207bp which yields a 1069 amino 
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acids protein. The HDAC9Aexon7 isoform results in an A 
to E substitution at position 222. The sequence specific 
to HDAC9ACD exon 12 is shown in italics. 

5 Figure 3 . Schematic representation of HDAC9 isof orms . The 

length of the various HDAC9 isoforms together with the 
primers used in cloning and RT-PCR analysis. 

Figure 4. Analysis of HDAC9 sequence. 
10 A. Amino acid sequence alignment of HDAC9 , HDAC4, HDAC5, 
HDAC7, and a bacterial deacetylase, HDLP. The indicated 
sequences were aligned using Clustal W. Identical 
residues are boxed and highlighted in dark grey; similar 
residues are shaded in light grey. 

15 

B. Evaluation of amino acid identities and similarities 
of HDAC9 compared with other his tone deacetylases . 

Values were obtained by comparing the whole protein or 
deacetylase domain of HDAC9 with the indicated protein 
20 sequences on BioEdit Sequence Alignment Editor using the 
Blosum62 matrix. 

C. Phylogenetic tree of HDAC1 through to HDAC9. Sequences 
were aligned using the Clustal W server at the Centre for 

25 Molecular and Biomolecular Informatics 

. (University of Nijmegen) . The PHYLIP (bracket) notation 
output was used to construct an unrooted tree. 

Figure 5. HDAC9 possesses HDAC activity and represses 
3 0 basal transcription . 

A. HDAC 9 deacetylates his tone H4 peptide. 293T cells 
were transfected with FLAG-tagged HDAC. Whole cell 
lysates were produced and the HDACs precipitated with 
anti-FLAG agarose. The beads were assayed for their 
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ability to deacetylate a [ 3 H] acetyl-labelled peptide 
corresponding to the N-Terminus of histone H4 . Free 
acetate was extracted and measured by scintillation 
counting. 

5 

B. GAL4-HDAC9 Inhibits promoter activity xn-vlvo in a TSA 
sensitive manner. GAL4-Tk-Luc was transiently 
'transfected into 293T cells together with lOOng GAL4DBD 
fusions. All Results were performed in duplicate and 
10 error. 

Figure 6. Northern blot showing the differential 
expression of HDAC9 polypeptide in human tissues. A 

multiple human tissue Northern Blot was analysed with a 
15 32 P labelled probe corresponding to the sequence found 
only in the full length HDAC9 cDNA in order to evaluate 
levels of expression of HDAC9 mRNA. The blot was 
reprobed with p-actin cDNA as a normalisation control. 
The tissues examined are indicated at the top of each 
20 lane. Positions of the RNA size markers are indicated in 
kilobases on the left of the blot. 

Figure 7. HDAC9 displays higher activity towards histone 
H3 peptide. 293T cells were transfected with FLAG-tagged 

25 HDAC. Whole cell lysates were produced and the HDACs 
precipitated with anti-FLAG agarose. The beads were 
assayed for their ability to deacetylate a [ 3 H] acetyl- 
labelled peptide corresponding to the N-Terminus of 
■Histone H3. Free acetate was extracted and measured by 

30 scintillation counting. 

Figure 8. Expression of various HDAC9 isoforms in normal 
and cancer cells. Total RNA was isolated from the 
indicated cell lines and tissues and cDNA generated with 
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M-MLV reverse transcriptase using either gene specific or 
random primers. Standard PCR was performed for 32 
cycles . 

5 Figure 9 . In vitro interactions of HDAC9 with various 

oncoproteins and co-repressors GST-HDAC9, GST-HDAC9CD and 
GST-MITR were produced from E. coll DH5a. [ 35 S]methionine- 
labeled proteins were synthesized in-vitro using a rabbit 
reticulocyte lysate-coupled transcription-translation 
10 system. Assays were performed in NETN Buffer, washed in 
H Buffer, separated by SDS-PAGE and visualised by 
autoradiography . 

Figure 10. Cellular localization of various isoforms of 
15 HDAC9. COS-1 cells were transiently transfected with N- 
Terminus flag tagged HDAC9, HDAC9ACD/MITR, HDAC9AExon7, 
HDAC9 AExonl2 or HDAC9AExonl5 as indicated. After 
methanol fixation, cells were stained with dapi (blue) 
and antibodies raised against HDAC9 or flag (green) . G- 
20 15 is an antibody to the N-Terminus of HDAC9 which h 
detects all isoforms (including HDAC9ACD/MITR) . 

Figure 11 . Effect of various isoforms of HDAC9 on the 
cellular localization of BCL-6. COS-1 cells were 
25 transiently transfected with N-Terminus flag tagged 

HDAC9, HDAC9ACD, or HDAC9Aexon7, as indicated. After 
methanol fixation, cells were stained with dapi (blue) 
and antibodies raised against HDAC9 G-15 (green) or BCL- 
6 (red) . 

30 

Figure 12 . Effect of various isoforms of HDAC9 on the 
cellular localization of NCoR. COS-1 cells were 
transiently transfected with N-Terminus flag tagged 
HDAC9, HDAC9ACD, or HDAC9Aexon7, as indicated After 
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methanol fixation, cells were stained with dapi (blue) 
and antibodies raised against FLAG (green) G-15 or NCoR 
. (red) . 

5 Figure 13. Effect of various isoforms of HDAC9 on the 
cellular localization of PLZF. COS-1 cells were 
transiently transfected with N-Terminus flag tagged 
HDAC9, or HDAC9 Aexon7, as indicated After methanol 
fixation, cells were stained with dapi (blue) and 
10 antibodies raised against HDAC9 G-15 (green) or PLZF 
(red) . 

Histone Deacetylase 9 Nucleic Acid 

A "HDAC9 nucleic acid" includes a nucleic acid molecule 

15 which has a nucleotide sequence encoding a polypeptide 
comprising the amino acid sequence shown in Figure 2 or 
any one of the other HDAC9 polypeptides of the present 
invention. The HDAC9 coding sequence may be the full 
length nucleic acid sequence shown in Figure 2, a 

20 complementary nucleic acid sequence, or it may be a 
sequence variant differing from one of the above 
sequences by one or more of addition, insertion, deletion 
and substitution of one or more nucleotides of the 
sequence shown. Changes to a nucleotide sequence may 

25 result in an amino acid change at the protein level, or 
not, as determined by the genetic code. Nucleic acid 
encoding a polypeptide which is a sequence variant 
preferably has at least 80% sequence identity, more 
preferably at least 95% sequence identity, more 

30 preferably at least 98% sequence identity, and most 
preferably at least 99% sequence identity with the 
nucleic acid sequence shown in Figure 2. 

The present invention also includes fragments of the 

13 
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HDAC9 nucleic acid sequences described herein, the 
fragments preferably being at least 60, 120, 180, 240, 
480 or 960 nucleotides in length. Preferred HDAC9 
nucleic acid fragments share at least part of their 
5 nucleic acid sequence with the HDAC9 nucleic acid 

• sequence which encodes amino acids 1008 to 1069 of the 
polypeptide shown in Figure 2, or a complementary 
sequence thereof. 

10 Generally, nucleic acid according to the present 

invention is provided as an isolate, in isolated and/or 
purified form, or free or substantially free of material 
with which it is naturally associated. Nucleic acid may 
be wholly or partially synthetic and may include genomic 

15 DNA, cDNA or RNA. Where nucleic acid according to the 
invention includes RNA, reference to the sequence shown 
should be construed as reference to the RNA equivalent, 
with U substituted for T. 

20 The present invention also includes nucleic acid 

molecules which are capable of hybridising to one of the 
HDAC9 sequences disclosed herein, or a complementary 
sequence thereof. Stringency of hybridisation reactions 
is readily determinable by one of ordinary skill in the 

25 art, and generally is an empirical calculation dependent 
upon probe length, washing temperature and salt 
concentration. In general, longer probes require higher 
temperatures for proper annealing, while shorter probes 
need lower temperatures. Hybridisation generally depends 

30 on the ability of denatured DNA to reanneal when 

complementary strands are present in an environment below 
their melting temperature. The higher the degree of 
desired homology between the probe and hybridisable 
sequence, the higher the relative temperature which can 
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be used. As a result, it follows that higher relative 
temperatures would tend to make the reaction conditions 
more stringent, while lower temperatures less so. For 
additional details and explanation of stringency of 
5 hybridisation reactions, see Ausubel et al, Current 
Protocols in Molecular Biology, Wiley Interscience 
Publishers, (1995) . 

. Preferably, a nucleic acid sequence will hybridise to a 
10 HDAC9 sequence of the invention, or a complementary 

sequence thereof under "stringent conditions". These are 
well known to those skilled in the art and include those 
that: (1) employ low ionic strength and high temperature 
for washing, for example 0.015 M sodium chloride/0.0015 M 
15 sodium citrate/ 0.1% sodium dodecyl sulfate at 50°C; (2) 
employ during hybridisation a denaturing agent, such as 
formamide, for example, 50% (v/v) formamide with 0.1% 
bovine serum albumin/0.1% Ficoll/0.1% 

polyvinylpyrrolidone/50mM sodium phosphate buffer at pH 
20 6.5 with 760 mM sodium chloride, 75 mM sodium citrate at 
42°C; or (3) employ 50% formamide, 5 x SSC (0.75 M NaCl, 
0.075 M sodium citrate), 50 mM sodium phosphate (pH 6 8), 
0.1% sodium pyrophosphate, 5 x Denhardt's solution, 
sonicated salmon sperm DNA (50 p,g/ml) , 0.1% SDS, and 10% 
25 dextran sulfate at 42°C, with washes at 42 °C in 0.2 x SSC 
(sodium chloride/sodium citrate) and 50% formamide at 
55°C, followed by a high-stringency wash consisting of 
0.1 x SSC containing EDTA at 55 °C. 

30 Nucleic acid sequences encoding all or part of the HDAC9 
gene and/or its regulatory elements can be readily 
prepared by the skilled person using the information and 
references contained herein and techniques known in the 
art (for example, see Sambrook, Fritsch and Maniatis, 
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Molecular Cloning, A Laboratory Manual , Cold Spring 
Harbour Laboratory Press, 1989, and Ausubel et al, Short 
Protocols in Molecular Biology, John Wiley and Sons, 
1992) . These techniques include (i) the use of the 

5 polymerase chain reaction (PCR) to amplify samples of 
such nucleic acid, e.g. from genomic sources, (ii) 
chemical synthesis, or (iii) amplification in E. coli. 
Modifications to the HDAC9 sequences can be made, e.g. 
using site directed mutagenesis, to provide expression of 

0 modified HDAC9 polypeptide or to take account of codon 
preference in the host cells used to express the nucleic 
• acid. 

PCR techniques for the amplification of nucleic acid are 
5 described in US Patent No. 4,683,195. The HDAC9 nucleic 
acid sequences provided herein readily allow the skilled 
person to design PCR primers. References for the general 
use of PCR techniques include Mullis et al, Cold Spring 
Harbour Symp. Quant. Biol., 51:263, (1987), Ehrlich (ed) , 
0 PCR Technology, Stockton Press, NY, 1989, Ehrlich et al, 
Science, 252:1643-1650, (1991), "PCR protocols; A Guide 
to Methods and Applications", Eds. Innis et al, Academic 
Press, New York, (1990). 

5 In order to obtain expression of the HDAC9 nucleic acid 
sequences, the sequences can be incorporated in a vector - 
having control sequences operably linked to the HADC9 
nucleic acid to control its expression. The vectors may 
include other sequences such as promoters or enhancers to 

0 drive the expression of the inserted nucleic acid, 

nucleic acid sequences so that the HDAC9 polypeptide is 
produced as a fusion and/or nucleic acid encoding 
secretion signals so that the polypeptide produced in the 
host cell is secreted from the cell. Suitable vectors 
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can be chosen or constructed, containing appropriate 
regulatory sequences, including promoter sequences, 
terminator fragments, polyadenylation sequences, enhancer 
sequences, marker genes and other sequences as 
5 appropriate. Vectors may be plasmids or viral, e.g. 

'phage, or phagemid, as appropriate. For further details 
see, for example, Molecular Cloning: a Laboratory Manual: 
2nd edition, Sambrook et al., 1989, Cold Spring Harbour 
Laboratory Press. Many known techniques and protocols 

10 for manipulation of nucleic acid, for example in 

preparation of nucleic acid constructs, mutagenesis, 
sequencing, introduction of DNA into cells and gene 
expression, and analysis of proteins, are described in 
detail in Current Protocols in Molecular Biology, Ausubel 

15 • et al. eds., John Wiley & Sons, 1992. 

HDAC9 polypeptide can be obtained by transforming the 
vectors into host cells in which the vector is 
functional, culturing the host cells so that the HDAC9 

20 polypeptide is produced and recovering the HDAC9 

polypeptide from the host cells or the surrounding 
medium. Prokaryotic and eukaryotic cells are used for 
this purpose in the art, including strains of E. coli, 
insect cells (e.g. transformed with baculovirus) , yeast, 

25 and eukaryotic cells such as COS or CHO cells. The 

choice of host cell can be used to control the properties 
of the HDAC9 polypeptide expressed in those cells, e.g. 
controlling where the polypeptide is deposited in the 
host cells or affecting properties such as its 

30 glycosylation and phosphorylation. If the polypeptide is 
expressed coupled to an appropriate signal leader peptide 
it may be secreted from the cell into the culture medium. 
Following production by expression, a polypeptide may be 
isolated and/or purified from the host cell and/or 
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culture medium, as the case may be, and subsequently used 
as desired, e.g. in the formulation of a composition 
which may include one or more additional components such 
as a carrier. Polypeptides may also be expressed in in 
5 vitro systems, such as reticulocyte lysate. 

The nucleic acid sequences provided herein are useful for 
identifying HDAC9 nucleic acid in a test sample. The 
present invention provides a method of obtaining nucleic 

10 acid of interest, the method including hybridising a 
probe sharing all or part of the sequence provided 
herein, or a complementary sequence, to the target 
nucleic acid. Hybridisation is generally followed by 
identification of successful hybridisation and isolation 

15 of nucleic acid which has hybridised to the probe, which 
may involve one or more steps of PCR. These methods may 
be useful in determining whether HDAC9 nucleic acid is 
•present in a sample, e.g. in a particular type of cells 
present in the sample. 

20 

Nucleic acid according to the present invention is 
obtainable using one or more oligonucleotide probes or 
primers designed to hybridise with one or more fragments 
of the nucleic acid sequence shown herein, particularly 

25 fragments of relatively rare sequence, based on codon 
usage or statistical analysis. A primer designed to 
hybridise with a fragment of the nucleic acid sequence 
shown in the above figures may be used in conjunction 
with one or more oligonucleotides designed to hybridise 

30 to^ a sequence in a cloning vector within which target 
nucleic acid has been cloned, or in so-called "RACE" 
(rapid amplification of cDNA ends) in which cDNA's in a 
library are ligated to an oligonucleotide linker and PCR 
is performed using a primer which hybridises with the 
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sequence shown herein and a primer which hybridises to 
the oligonucleotide linker. 

On the basis of amino acid sequence information, 
5 oligonucleotide probes or primers may be designed, taking 
into account the degeneracy of the genetic code, and 
where appropriate, codon usage of the organism from the 
candidate nucleic acid is derived. An oligonucleotide 
for use in nucleic acid amplification may have about 10 

10 or fewer codons (e.g. 6, 7 or 8), i.e. be about 30 or 
fewer nucleotides in length (e.g. 18, 21 or 24). 
Generally specific primers are upwards of 14 nucleotides 
in length, but not more than 18-20. Those skilled in the 
art are well versed in the design of primers for use 

15 processes such as PCR. 

Accordingly, a further aspect of the present invention 
provides an oligonucleotide or nucleotide fragment of the 
one of the nucleotide sequence disclosed herein, or a 

20 complementary sequence, in particular for use in a method 
of obtaining and/or screening nucleic acid. The 
sequences referred to above may be modified by addition, 
substitution, insertion or deletion of one or more 
nucleotides, but preferably without abolition of ability 

25 to hybridise selectively with nucleic acid with the 
sequence shown herein, that is wherein the degree of 
sequence identity of the oligonucleotide or 
polynucleotide with one of the sequences given is 
sufficiently high. 

30 

In some preferred embodiments, oligonucleotides according 
to the present invention that are fragments of any of the 
nucleic acid sequences provided herein, or complementary 
sequences thereof, are at least about 10 nucleotides in 
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length, more preferably at least about 15 nucleotides in 
length, more preferably at least about 20 nucleotides in 
length. Such fragments themselves individually represent 
• aspects of the present invention. Fragments, and other 
5 oligonucleotides may be used as primers or probes as 
discussed but may also be generated (e.g. by PCR) in 
methods concerned with determining the presence of HDAC9 
nucleic acid in a test sample. 

10 HDAC9 Polypeptides 

The HDAC9 polypeptides disclosed herein, or fragments or 
active portions thereof, can be used as pharmaceuticals, 
in the developments of drugs, for further study into its 
properties and role in vivo and to screen for HDAC9 

15 inhibitors. Thus, a further aspect of the present 

invention provides a polypeptide which has the amino acid 
sequence shown in Figure 2, which may be in isolated 
and/or purified form, free or substantially free of 
material with which it is naturally assopiated, such as 

20 polypeptides other than HDAC9 . 

The present invention also includes active portions, 
domains and fragments (including domains) of the HDAC9 
•polypeptides of the invention. 

25 

An "active portion" of'HDAC9 polypeptide means a peptide 
which is less than said full length HDAC9 polypeptide, 
but which retains at least some of its essential 
biological activity, e.g. as a deacetylase. Active 
30 portions may be great than 100 amino acids, more 

preferably greater than 200 amino acids, more preferably 
greater than 300 amino acids and most preferably greater 
than 4 00 amino acids in length. Preferred active portion 
include all or part of the 61 C-terminal amino acids 
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shown in Figure 2. 

A "fragment" of the HDAC9 polypeptide means a stretch of 
amino acid residues of at least 5 contiguous amino acids 
5 from the sequences set out as Figure 2, or more 

■preferably at least 7 contiguous amino acids,, or more 
preferably at least 10 contiguous amino acids or more 
preferably at least 20 contiguous amino acids or more 
preferably at least 4 0 contiguous amino acids. Fragments 
10 of the HDAC9 polypeptide sequences may be useful as 

antigenic determinants or epitopes for raising antibodies 
to a portion of the HDAC9 amino acid sequence which also 
forms part of the present invention. For instance, 
fragments of HDAC9 can act as sequestrators or 
15 competitive antagonists by interacting with other 

proteins, e.g. if they possess a protein interaction 
domain present in the full length HDAC9 sequence. 

Polypeptides which are amino acid sequence variants are 
also provided by the present invention. A "sequence 
variant" of the HDAC9 polypeptide, or an active portion 
or fragment thereof, means a polypeptide modified by 
varying the amino acid sequence of the protein, e.g. by 
manipulation of the nucleic acid encoding the protein or 
by altering the protein itself. Such sequence variants 
of the natural amino acid sequence may involve insertion, 
addition, deletion or substitution of one, two, three, 
five, ten, twenty or more amino acids. 

Preferred polypeptides have HDAC9 biological activity, 
that is, they are capable of removing an acetyl group 
from a substrate such as a histone (e.g. histone H3 or 
H4). HDAC9 and isoforms of the protein may have other 
activities and one potential role is linked with the 
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Aexon7 isoform. It has been suggested that HDACs can 
function as molecular reservoirs within the cell as well 
as deacetylating histones, and also as transcription 
factors. We have observed that HDAC9 interacts in vitro 
5 and co-localizes in vivo with a number of transcriptional 
repressors, such as TEL, PLZF and BCL-6, whose function 
has been implicated in the pathogenesis of hematological 
malignancies, as well as the co-repressors mSIN3A/B and 
. NCoR. When HDAC9 or HDAC9Aexon7 are co-transf ected into 

10 COS cells they have the ability to delocalize BCL-6 and 
PLZF. NCoR is also delocalised. The interesting aspect 
of these interactions is that HDAC9 is located in the 
nucleus whilst HDAC9Aexon7 is cytoplasmic. Additionally, 
the HDAC9ACD/hMITR isoform displays a microspeckled 

15 distribution within the nucleus which is distinct to that 
of HDAC9. 

Preferably, a polypeptide which is an amino acid sequence 
variant of the amino acid sequence shown in Figure 2 at 

20 'least 80% sequence identity, more preferably at least 90% 
sequence identity, more preferably at least 95% sequence 
identity, more preferably at least 97% sequence identity 
more preferably at least 98% sequence identity and most 
preferably at least 99% sequence identity to the 

25 sequences of Figure 2. 

The skilled person can readily make sequence comparisons 
and determine identity using techniques well known in the 
art, e.g. using the GCG program which is available from 
30 Genetics Computer Group, Oxford Molecular Group, Madison, 
Wisconsin, USA, Version 9.1. Particular amino acid 
sequence variants may differ from those shown in SEQ ID 
Nos: 2 or 4 by insertion, addition, substitution or 
deletion of 1 amino acid, 2, 3, 4, 5-10, 10-20 20-30, 30- 
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50, 50-100, 100-150, or more than 150 amino acids. 

"Percent (%) amino acid sequence identity'' with respect 
to the HDAC9 polypeptide sequences identified herein is 
5 defined as the percentage of amino acid residues in a 

candidate sequence that are identical with the amino acid 
residues in the HDAC9 sequence, after aligning the 
sequences and introducing gaps, if necessary, to achieve 
the maximum percent sequence identity, and not 
10 considering any conservative substitutions as part of the 
sequence identity. The % identity values can be 
generated by WU-BLAST-2 which was obtained from [Altschul 
et al, Methods in Enzymology, 266:460-480 (1996); 
http : //blast . wustl/edu/blast /README . html] . WU-BLAST-2 

15 uses several search parameters, most of which are set to 
the default values. The adjustable parameters are set 
with the following values: overlap span=l, overlap 
fraction^ 0.125, word threshold (T) =11 . The HSPS and 
HSPS2 parameters are dynamic values and are established 

20 by the program itself depending upon the composition of 

the particular sequence and composition of the particular 
database against which the sequence of interest is being 
-searched; however, the values may be adjusted to increase 
sensitivity. A % amino acid sequence identity value is 

25 determined by the number of matching identical residues 
divided by the total number of residues of the "longer" 
sequence in the aligned region. The "longer" sequence is 
the one having the most actual residues in the aligned 
region (gaps introduced by WU-Blast-2 to maximize the 

30 alignment score are ignored) . 

Similarly, "percent (%) nucleic acid sequence identity" 
with respect to the coding sequence of the HDAC9 
polypeptides identified herein is defined as the 
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percentage of nucleotide residues in a candidate sequence 
that are identical with the nucleotide residues in the 
HDAC9 coding sequence as provided in Figure 2. The 
identity values used herein were generated by the BLASTN 
5 .module of WU BLAST-2 set to the default parameters, with 
overlap span and overlap fraction set to 1 and 0.125, 
respectively. 

A polypeptide according to the present invention may be 
10 isolated and/or purified (e.g. using an antibody) for 
instance after production by expression from encoding 
nucleic acid. Polypeptides according to the present 
invention may also be generated wholly or partly by 
chemical synthesis. The isolated and/or purified 
15 polypeptide may be used in formulation of a composition, 
which may include at least one additional component. 

The HDAC9 polypeptides can also be linked to a coupling 
partner, e.g. an effector molecule, a label, a drug, a 
20 toxin and/or a carrier or transport molecule. Techniques 
for coupling the peptides of the invention to both 
peptidyl and non-peptidyl coupling partners are well 
known in the art. 

25 Antibodies Capable of Binding HDAC9 Polypeptides 

A further important use of the HDAC9 polypeptides is in 
raising antibodies that have the property of specifically 
binding to the HDAC9 polypeptides or fragments thereof. 
The techniques for producing monoclonal antibodies to 
30 HDAC9 protein are well established in the art. Preferred 
antibodies are capable of specifically binding to an 
epitope of HDAC9 located or partially located between 
amino acids 1009 and 1069 of the amino acid sequence 
shown in Figure 2. The anti-HDAC9 antibodies of the 
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present invention may be specific in the sense of being 
able to distinguish between the polypeptide it is able to 
bind and other human polypeptides for which it has no or 
substantially no binding affinity (e.g. a binding 
5 affinity of about lOOOx worse) . Specific antibodies bind 
an epitope on the molecule which is either not present or 
is not accessible on other molecules. Other preferred 
antibodies include those capable of substantially 
neutralising a HDAC9 biological activity and especially 
10 biological activity as a histone deacetylase. 

Antibodies may be obtained using techniques which are 
standard in the art. Methods of producing antibodies 
include immunising a mammal (e.g. mouse, rat, rabbit, 
15 horse, goat, sheep or monkey) with the protein or a 
fragment thereof. Antibodies may be obtained from 
immunised animals using any of a variety of techniques 
known in the art, and screened, preferably using binding 
of antibody to antigen of interest. For instance, 
20 Western blotting techniques or immunoprecipitation may be 
used (Armitage et al, Nature, 357:80-82, 1992). 
Isolation of antibodies and/or antibody-producing cells 
from an animal may be accompanied by a step of 
sacrificing the animal. As an alternative or supplement 
25 to immunising a mammal with a peptide, an antibody 
specific for a protein may be obtained from a 
recombinantly produced library of expressed 
immunoglobulin variable domains, e.g. using lambda 
bacteriophage or filamentous bacteriophage which display 
30 functional immunoglobulin binding domains on their 

surfaces; for instance see WO92/01047. The library may 
be naive, that is constructed from sequences obtained 
from an organism which has not been immunised with any of 
the proteins (or fragments) , or may be one constructed 
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' using sequences obtained from an organism which has been 
exposed to the antigen of interest. 

Antibodies according to the present invention may be 
modified in a number of ways that are well known in the 
5 art. Indeed the term "antibody" should be construed as 
covering any binding substance having a binding domain 
with the required specificity. Thus the invention covers 
antibody fragments, derivatives, functional equivalents 
and homologues of antibodies, including synthetic 

10 molecules and molecules whose shape mimics that of an 
antibody enabling it to bind an antigen or epitope. 
Humanised antibodies in which CDRs from a non-human 
source are grafted onto human framework regions, 
typically with the alteration of some of the framework 

15 amino acid residues, to provide antibodies which are less 
immunogenic than the parent non-human antibodies, are 
also included within the present invention. 

A hybridoma producing a monoclonal antibody according to 
20 the present invention may be subject to genetic mutation 
or other changes. It will further be understood by those 
- skilled in the art that a monoclonal antibody can be 
subjected to the techniques of recombinant DNA technology 
to produce other antibodies or chimeric molecules which 
25 retain the specificity of the original antibody. Such 
techniques may involve introducing DMA encoding the 
immunoglobulin variable region, or the complementarity 
determining regions (CDRs), of an antibody to the 
constant regions, or constant regions plus framework 
30 regions, of a different immunoglobulin. See, for 

instance, EP 0 184 187 A, GB 2 188 638 A or EP 0 239 400 
A. Cloning and expression of chimeric antibodies are 
described in EP 0 120 694 A and EP 0 125 023 A. 
In a further aspect the present invention provides a 
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method of making antibodies, the method comprising 
employing a HDAC9 polypeptide or a fragment thereof as a: 
immunogen. The present invention also provides a method 
of screening for antibodies which are capable of 
specifically binding HDAC9 polypeptide, the method 
comprising contacting a HDAC9 polypeptide with one or 
more candidate antibodies and detecting whether binding 
occurs . 
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10 Preferred antibodies according to the invention are 

isolated, in the sense of being free from contaminants 
such as antibodies able to bind other polypeptides and/or 
free of serum components. Monoclonal antibodies are 
preferred for some purposes, though polyclonal antibodies 

15 are within the scope of the present invention. 

Hybridomas capable of producing antibody with desired 
binding characteristics are within the scope of the 
present invention, as are host cells, eukaryotic or 
prokaryotic, containing nucleic acid encoding antibodies 
(including antibody fragments) and capable of their 
expression. The invention also provides methods of 
production of the antibodies including growing a cell 
capable of producing the antibody under conditions in 
which the antibody is produced, and preferably secreted. 

The reactivities of antibodies on a sample may be 
determined by any appropriate means. Tagging with 
individual reporter molecules is one possibility. The 
reporter molecules may directly or indirectly generate 
detectable, and preferably measurable, signals. The 
linkage of reporter molecules may be directly or 
indirectly, covalently, e.g. via a peptide bond or non- 
covalently. Linkage via a peptide bond may be as a 
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result of recombinant expression of a gene fusion 
encoding antibody and reporter molecule. One favoured 
mode is by covalent linkage of each antibody with an 
individual f luorochrome, phosphor or laser exciting dye 
5 with spectrally isolated absorption or emission 
characteristics. Suitable f luorochromes include 
.fluorescein, rhodamine, phycoerythrin and Texas Red. 
Suitable chromogenic dyes include diaminobenzidine . 

10 Other reporters include macromolecular colloidal 

particles or particulate material such as latex beads 
that are coloured, magnetic or paramagnetic, and 
biologically or chemically active agents that can 
directly or indirectly cause detectable signals to be 

15 visually observed, electronically detected or otherwise 
recorded. These molecules may be enzymes which catalyse 
reactions that develop or change colours or cause changes 
in electrical properties , for example. They may be 
molecularly excitable, such that electronic transitions 

20 between energy states result in characteristic spectral 
absorptions or emissions. They may include chemical 
entities used in conjunction with biosensors. 
Biotin/avidin or biotin/streptavidin and alkaline 
phosphatase detection systems may be employed. 

25 

Antibodies according to the present invention may be used 
in screening for the presence of a polypeptide, for 
example in a test sample containing cells or cell lysate 
as discussed, and may be used in purifying and/or 
30 isolating a polypeptide according to the present 

invention, for instance following production of the 
polypeptide by expression from encoding nucleic acid. 
Antibodies may modulate the activity of the polypeptide 
to which they bind and so, if that polypeptide has a 
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deleterious effect in an individual, may be useful in a 
therapeutic context (which may include prophylaxis) . 

Screening for Mimetic Substances 

5 The present invention further relates to the use of a 
histone deacetylase 9 polypeptide or nucleic acid 
molecule for screening for candidate compounds which (a) 
share a histone deacetylase 9 biological activity or (b) 
bind to the histone deacetylase 9 polypeptide or (c) 
10 inhibit a biological activity of a histone deacetylase 9 
polypeptide. 

It is well known that pharmaceutical research leading to 
the identification of a new drug may involve the 

15 screening of very large numbers of candidate substances, 
both before and even after a lead compound has been 
found. This is one factor which makes pharmaceutical 
research very expensive and time-consuming. Means for 
assisting in the screening process can have considerable 

20 commercial importance and utility. 

By way of example, screening can be carried out to find 

peptidyl or non-peptidyl mimetics or inhibitors of the 

HDAC9 polypeptides to develop as lead compounds in 
25 pharmaceutical research. 

In this aspect of the invention, preferably the activity 
of the HDAC9 polypeptide is the activity of removing an 
acetyl groups from the substrate. Conveniently, the 
30 progress of this reaction can be assessed by labelling 

the substrate with a detectable label (e.g. a radioactive 
label) and measuring the amount of label released from 
the substrate by the action of the HDAC9 polypeptide, 
e.g. in a scintillation proximity assay. Preferably, the 
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method is for screening for modulators of HDAC9 which may 
be further tested for use a therapeutic, especially for 
the treatment of cancer. 

5 A method of screening for a substance which modulates 
activity of a polypeptide may include contacting one or 
more test substances with a HDAC9 in a suitable reaction 
medium, testing the activity of the treated polypeptide 
and comparing that activity with the activity of the 
10 polypeptide in comparable reaction medium untreated with 
the test substance or substances. A difference in 
activity between the treated and untreated polypeptides 
is indicative of a modulating effect of the relevant test 
substance or substances. 

15 

Combinatorial library technology provides an efficient 
way of testing a potentially vast number of different 
substances for ability to modulate activity of a 
polypeptide. Such libraries and their use are known in 
20 the art. The use of peptide libraries is preferred. 

Prior to or as well as being screened for modulation of 
activity, test substances may be screened for ability to 
.interact with the polypeptide, e.g. in a yeast two-hybrid 

25 system (which requires that both the polypeptide and the 
test substance can be expressed in yeast from encoding 
nucleic acid) . This may be used as a coarse screen prior 
to testing a substance for actual ability to modulate 
activity of the polypeptide. Alternatively, the screen 

30 could be used to screen test substances for binding to a 
HDAC9 specific binding partner, to find mimetics of the 
HDAC9 polypeptide, e.g. for testing as therapeutics. 

In one embodiment, the present invention provides a 
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method of identifying a compound which is capable of 
modulating an activity of a histone deacetylase 9 
polypeptide, the method comprising: 

(a) contacting at least one candidate compound with 
5 a histone deacetylase 9 (HDAC9) polypeptide as defined 

.herein under conditions in which the candidate compound 
and HDAC9 polypeptide are capable of interacting; 

(b) determining in an assay for a HDAC9 activity 
whether the candidate compound modulates the activity; 

10 and 

(c) selecting a candidate compound which modulates 
an activity of the HDAC9 polypeptide. 
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In a preferred embodiment, the present invention provides 
a method of identifying a compound which is capable of 
inhibiting histone deacetylase 9 (HDAC9) polypeptide, the 
method comprising: 

(a) contacting at least one candidate compound and a 
HDAC9 polypeptide as defined herein in the presence of a 
substrate for HDAC 9 under conditions in which the 
candidate compound, HDAC 9 polypeptide and HDAC 9 substrate 
are capable of interacting; 

(b) determining whether the candidate compound 
inhibits the activity of the HDAC 9 polypeptide in 

25 reacting with the substrate; and, 

(c) selecting the candidate compound which inhibits 
the activity of the HDAC 9 polypeptide on the substrate. 



30 



Following identification of a candidate compound which 
modulates or inhibits HDAC 9 activity, the substance may 
be investigated further. Furthermore, it may be 
manufactured and/or used in preparation, i.e. manufacture 
or formulation, of a composition such as a medicament, 
pharmaceutical composition or drug. 
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Diagnostic Methods 

Recent studies have implicated abnormal histone 
deacetylase function with a number of human cancers. 
5 Accordingly, the present invention also provides the use 
of HDAC9 as a diagnostic marker for cancer, e.g. by 
correlating this level with the amount of the HDAC9 
polypeptide, an isoform thereof, or HDAC9 nucleic acid 
present in a control. 

10 

Under normal circumstances, the expression of HDAC9 
isoforms which contain the catalytic domain appears to be 
tightly regulated, with the protein only found at high 
levels in adult and foetal brain, and to a lesser extent 

15 skeletal muscle, testes, bone marrow, thymus, spleen, 

CD14 +Ve , and CD19 +Ve cells. However, inspection of the RT 
PCR data reveals that HDAC9 is specifically expressed in 
TEL-AML1 positive and negative pre-B cell acute 
lymphoblastic leukaemia, and B cell lymphoma cell lines 

20 .and patient samples. HDAC9 is not expressed in various 

acute myeloid leukaemia cell lines, with the exception of 
acute monocytic leukaemia, possibly reflecting a common 
progenitor for B cell and monocytic lineages. HDAC9 is 
also found at lower levels in the erythroleukemia cell 

25 line, HEL. The cell line which overexpress HDAC9 also 
express the AExon7 and AExonl2 isoforms at higher levels 
and in one the of CLL patient samples which have been 
stimulated with SAC/IL2, there is a clear change in the 
relative expression of full length and AExonl2 isoforms. 

30 There is evidence that the addition of HDAC inhibitors 
can alter the cellular distribution of HDACs presumably 
through interfering with protein-protein interactions as 
well blocking the catalytic site and this potential of 
HDAC 9 in recruiting and modifying of other proteins, as 
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well histone deacetylation. Taking this into 
consideration, the AExon7 may prove to be very important 
in the pathogenesis of some leukaemias. 



5 In this context, there are a number of methods known in 
the art for analysing samples from individuals to 
determine the presence of HDAC9 nucleic acid or 
polypeptide. The assays may determine the presence or 
amount of HDAC9 nucleic acid or polypeptide in a sample 

10 from a patient, and whether the nucleic acid or 

polypeptide is full length or has a HDAC9 biological 
activity. Examples of biological samples include blood, 
plasma, serum, tissue samples, tumour samples, saliva and 
urine. The purpose of such analysis may be used for 

15 diagnosis or prognosis, to assist a physician in 

determining the severity or likely course of the conditioi 
and/or to optimise treatment of it. 



Exemplary approaches for detecting HDAC9 nucleic acid or 
20 polypeptides include: 

(a) determining the presence or amount of HDAC9 

polypeptide in a sample from a patient, by measuring an 

activity of the HDAC9 polypeptide or its presence in a 

binding assay; or, 
25 < b ) determining the presence of HDAC9 nucleic acid 

using a probe capable of hybridising to the HDAC9 nucleic 

acid; 

(c) using PCR involving one or more primers based on 
a HDAC9 nucleic acid sequence to determine whether the 
30 HDAC9 transcript is present in a sample from a patient. 

In one embodiment, the method comprises the steps of: 

(a) contacting a sample obtained from the patient 
•with a solid support having immobilised thereon binding 
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agent having binding sites specific for HDAC9 polypeptide 
or HDAC9 nucleic acid; 

(b) contacting the solid support with a labelled 
developing agent capable of binding to unoccupied binding 

5 sites, bound HDAC9 polypeptide or nucleic acid or occupied 
binding sites; and, 

(c) detecting the label of the developing agent 
specifically binding in step (b) to obtain a value 
representative of the presence or amount of the HDAC9 

10 polypeptide or nucleic acid in the sample. 

The binding agent preferably is a specific binding agent 
and has one or more binding sites capable of specifically 
binding to HDAC9 or nucleic acid in preference to other 
15 molecules. Conveniently, the binding agent is immobilised 
•on solid support, e.g. at a defined location, to make it 
easy to manipulate during the assay. 

Examples of specific binding pairs are antigens and 
20 antibodies, molecules and receptors and complementary 

nucleotide sequences. The skilled person will be able to 
think of many other examples and they do not need to be 
listed here. Further, the term "specific binding pair" is 
also applicable where either or both of the specific 
25 binding member and the binding partner comprise a part of 
a larger molecule. In embodiments in which the specific 
binding pair are nucleic acid sequences, they will be of a 
length to hybridise to each other under the conditions of 
the assay, preferably greater than 10 nucleotides long, 
30 more preferably greater than 15 or 20 nucleotides long. 

There are various methods for determining the presence or 
absence in a test sample of a particular nucleic acid 
sequence, such as the sequence shown in Figure 2. 
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Exemplary tests include nucleotide sequencing, 
hybridisation using nucleic acid immobilized on chips, 
molecular phenotype tests, protein truncation tests (PTT), 
single-strand conformation polymorphism (SSCP) tests, 
5 mismatch cleavage detection and denaturing gradient gel 
electrophoresis (DGGE) . These techniques and their 
advantages and disadvantages are reviewed in Nature 
Biotechnology, 15:422-426, 1997. 

10 Pharmaceutical Compositions 

The present invention disclose the use of HDAC9 or 
inhibitors thereof for formulation in pharmaceutical 
compositions, and especially compositions for the 
treatment of cancer, and more especially leukaemias such 
15 as TEL-AML1 positive and negative pre-B cell acute 

lymphoblastic leukaemia, and B cell lymphoma. These 
compositions may comprise, in addition to one of the above 
substances, a pharmaceutical^ acceptable excipient, 
.carrier, buffer, stabiliser or other materials well known 
20 to those skilled in the art. Such materials should be 
non-toxic and should not interfere with the efficacy of 
the active ingredient. The precise nature of the carrier 
or other material may depend on the route of 
administration, e.g. oral, intravenous, cutaneous or 
25 subcutaneous, nasal, intramuscular, intraperitoneal 
routes. 



Pharmaceutical compositions for oral administration may be 
in tablet, capsule, powder or liquid form. A tablet may 
include a solid carrier such as gelatin or an adjuvant. 
Liquid pharmaceutical compositions generally include a 
liquid carrier such as water, petroleum, animal or 
vegetable oils, mineral oil or synthetic oil. 
Physiological saline solution, dextrose or other 
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saccharide solution or glycols such as ethylene glycol, 
propylene glycol or polyethylene glycol may be included. 

For intravenous, cutaneous or subcutaneous injection, or 
5 injection at the site of affliction, the active ingredient 
will be in the form of a parenterally acceptable aqueous 
solution which is pyrogen-free and has suitable pH, 
isotonicity and stability. Those of relevant skill in the 
art are well able to prepare suitable solutions using, for 
10 example, isotonic vehicles such as Sodium Chloride 
Injection, Ringer's Injection, Lactated Ringer's 
Injection. Preservatives, stabilisers, buffers, 
antioxidants and/or other additives may be included, as 
required. 

Whether it is a polypeptide, antibody, peptide, nucleic 
acid molecule, small molecule or other pharmaceutically 
useful compound according to the present invention that 
is to be given to an individual, administration is 
preferably in a "prophylactically effective amount" or a 
"therapeutically effective amount" (as the case may be, 
although prophylaxis may be considered therapy) , this 
being sufficient to show benefit to the individual. The 
actual amount administered, and rate and time-course of 
administration, will depend on the nature and severity of 
what is being treated. Prescription of treatment, e.g. 
decisions on dosage etc, is within the responsibility of 
general practitioners and other medical doctors, and 
typically takes account of the disorder to be treated, 
the condition of the individual patient, the site of 
delivery, the method of administration and other factors 
known to practitioners. Examples of the techniques and 
protocols mentioned above can be found in Remington's 
Pharmaceutical Sciences, 20th Edition, 2000, pub. 
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10 



Lippincott, Williams & Wilkins. A composition may be 
administered alone or in combination with other 
treatments, either simultaneously or sequentially, 
dependent upon the condition to be treated. 

Experimental 
Cloning of HP AC 9 

The National Institute for Biotechnology Information 
(NCBI) high throughput genomic sequence database (htgs) 
was searched with the amino acid sequences corresponding 
to the deacetylase domain of HDAC5 in an effort to 
identify novel histone deacetylase genes. Several DNA 
sequences encoding peptides with significant homology to 
HDAC5 were found on a human BAC clone RP11-8I15 
15 (accession no. AC016186) which contains 70 unordered 
contigs and maps to chromosome 18. When the Genbank 
nucleotide database was searched with a composite of the 
novel sequences showing homology to HDAC5, it aligned 
with BAC clone CTB-13P7 (accession no. AC002088) which 
20 mapped to chromosome 7pl5-p21 (Table 1), indicating the 
presence of a pseudogene or incorrect entry into the htgs 
database. A search of the literature revealed that 7pl5- 
p21 had already been identified as containing a potential 
HDAC open reading frame and that a gene encoding a 
25 protein called HDRP/MITR which shares 50% identity with 
• the N-terminus of HDACs 4 and 5, had been cloned and also 
mapped to this genomic region. Since a search of the 
expressed sequence tag database (dbEST) revealed that 
cDNA corresponding to a partial HDAC domain had been 
isolated from germinal centre B cells (accession no. 
AA287983), several hematopoietic cell lines were analysed 
by reverse transcription (RT)-PCR with oligonucleotides 
corresponding to the HDAC domain and HDRP/MITR. It was 
found that not only was it possible to amplify bands from 
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the HDAC domain itself but also from the cDNA of 
HDRP/MITR through to the HDAC domain and this resulted in 
the generation of around 2700 bp of sequence. 3' RACE 
failed to yield the remaining sequence so an attempt was 

5 made to try to find overlapping ESTs in order to 'walk 1 
along the cDNA (Figure 1) and confirm any matching 
sequence against the genomic clones. During this 
" investigation, it became apparent that the sequence 
corresponding to the final 5 exons of HDAC 9 (accession 

0 no. RP5-1194E15 and GS1-465N13) had been submitted in the 
antisense direction in relation to the rest of the gene 
(see Table 1) . As these clones accounted for almost 
160kb of DNA (out of a total of 500kb for the entire 
gene), search algorithms had not previously identified 

5 homology between the EST and genomic sequences. Clones 
spanning the HDAC 9 gene are listed in Table 2. When the 
database searches * were repeated with the last two genomic 
clones in the correct orientation, an alignment was 
obtained, which showed significant homology with the 

0 sequence for HDACs 4 and 5. This information permitted 
the cloning of the entire open reading frame of HDAC 9 
(Figure 2) from Marathon-Ready Human Brain cDNA 
(Clontech) using the sense primer 9F1 5'- 

ATGCACAGTATGATCAGCTCA-3 1 and the antisense primer 9R1 5'- 
5 GTCACACACAGGAAATATCAG-3 1 . HDAC9ACD/HDRP/MITR was cloned 
from Marathon-Ready Human Brain cDNA (Clontech) using the 
sense primer 9F1 and the antisense primer 9ACDR1 5 f - 
TCAGATAATGACTTTAATTACAAAT-3 1 (see Figure 2 for the cDNA 
. sequence and translation) . HDAC9AExon7 and HDAC9AExonl2 
0 were cloned from the acute monocytic leukaemia cell line 
MONO-MAC-6 using the sense primer 9F1 and the antisense 
primer 9R3 5 1 -TCTCTAATCCATCCATGCCAA-3 1 . HDAC9AExonl5 was 
cloned from the acute Pre-B ALL cell line REH using the 
sense primer 9F2 5 1 -AGGCTGCTTTTATGCAACAG-3 1 and the 
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antisense primer 9R2 5 ' -CTGAATGCTTCAAGGTACTCA-3 ' . See 
Figure 3 for a schematic detailing the positions of the 
above primers. PCR products were cloned into pCRII 
(Invitrogen) and sequenced using BigDye (Perkin Elmer) . 

5 

Analysis of HDAC9 sequence 

The full-length product of the HDAC9 gene comprises 1069 
amino acids as shown in Figures 2 (see Figure 3 for 
schematic) , and is encoded by exons 2-26 (exon 1 is 
10 untranslated).. The 26 exons, which form the HDAC9 cDNA, 
■span 500 kb of genomic sequence on chromosome 7pl5-p21. 
HDAC9ACD, also referred to as HDRP/MITR is 593 amino acids 
in length and contains 16 residues of unique sequence 
encoded by a region of exon 12 which is 3* to the splice 
15 donor site used to generate the HDAC9 ORF. There are 

several exon deletions which may occur that preserve the 
open reading frame of HDAC9 (Table 2) and two have been 
identified and cloned. HDAC9Aexon7 is 1025 amino acids 
long and contains an Ala to Glu substitution at position 
20 222 as the result of the deletion of exon 7. This isoform 
lacks two serines (S223 and S253) which when 
phosphorylated have been implicated in 14-3-3 protein 
dependent shuttling of HDAC4 and 5 from the nucleus to the 
cytoplasm, and also a tripartite nuclear localization 
25 signal. HDAC9Aexonl2 is 981 amino acids long and lacks 

exon 12 sequences encoding a leucine zipper motif that may 
mediate interactions of HDAC9 with other proteins. 

An isoform, which possesses neither exon 7 nor exon 12, 
30 has also been cloned. HDAC9Aexonl5 is 1027 amino acids 
long and lacks a region within the catalytic domain 
adjacent to the active site. This isoform does not 
naturally conserve the ORF of HDAC9 and appears to have 
undergone RNA editing as shown below. 
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exonl4 exonl6 
HDAC9 5'-aaa tgt gag -exonl5- gt gat gac-3' 

HDAC9Aexonl5 5 T -aaa tgt gac gac-3 1 

K C D D 

5 Additionally, exon 4 may be deleted, which introduces an 
in-frame stop codon and results in a truncated protein of 
95 amino acids. Exon 4 may also be alternatively spliced 
at either end as indicated in Table 2. 



10 HDAC9 possesses histone deacetylase activity and 
represses basal transcription 

In order to determine whether HDAC9 possesses histone 
deacetylase activity, an in-vitro assay was performed 
using anti-FLAG immunoprecipitated HDAC9. As shown in 
15 Figure 5A, HDAC9 contains about 25% activity of HDAC1 and 
50% activity of HDACs 4 and 5. Higher activities of HDAC9 
have been detected towards peptide substrates derived from 
histone H3 than from histone H4 (Figure 7A and B) . 

20 It has been previously established that HDACs repress 
transcription when tethered to DNA as Gal4 fusion 
proteins. This effect is also observed with HDAC9, 
HDAC9ACD and HDAC9CD alone (Figure 5B) . A GAL4 uas x5-TK- 
Luciferase reporter gene was transiently transfected into 

25 . 293T cells together with the expression vectors for the 

indicated GAL 4 fusion proteins. The repressive effects of 
various GAL4DBD-HDAC9 fusions were partially relieved by 
the addition of the HDAC inhibitor Trichostatin A, 
including that of HDAC9ACD. Although HDAC9ACD lacks an 

30 HDAC domain it has been shown to bind other HDACs directly 
and via co-repressors, and this may be reflected in its: 
ability to repress in a TSA sensitive manner. These 
results indicate that HDAC 9 is able to use Histone H4 as a 
substrate and functions, at least in part, as a component 
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of the repression machinery of the cell. 

Expression patterns of HDAC9 in normal human tissue and 
cancer cells 

5 Under normal circumstances, the expression of HDAC9 

isoforms which contain the catalytic domain appears to be 
■ tightly regulated, with the protein only found at high 
levels in adult and foetal brain, and to a lesser extent 
skeletal muscle, testes, bone marrow, thymus, spleen, 

10 CD14 +ve , and CD19 +ve cells (Figure 8). However, inspection 
of RT-PCR data reveals that HDAC9 is specifically 
expressed in TEL-AML1 positive and negative pre-B cell 
acute lymphoblastic leukaemia samples, and B cell lymphoma 
cell lines and patient samples (Figure 8A and B) . HDAC9 

15 is not expressed in various acute myeloid leukaemia cell 
lines, with the exception of acute monocytic leukaemia, 
possibly reflecting a common progenitor for B cell and 
monocytic lineages. HDAC9 is also found at lower levels in 
the erythroleukemia cell line, HEL. The leukaemic cell 

20 lines, which over-express HDAC9, also over-express the 
isoforms lacking exon7 and exonl2 relative to normal 
levels (see Figures 8B and C) . In one the CLL patient 
sample that has been stimulated with SAC/IL2, there is a 
clear change in the relative expression of full length and 

25 AExonl2 isoforms (Figure 8C) . There is evidence that the 
addition of HDAC inhibitors can alter the cellular 
distribution of HDACs presumably through interfering with 
protein-protein interactions as well blocking the 
catalytic site with respect to HDAC9, such findings should 

30 be considered in the context of various isoforms and their 
function. HDAC 9 isoforms lacking functional domains 
encoded by exons 7 or 12 may prove to be an important 
factor in the pathogenesis of some hematalogical 
malignancies. 
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In-vltro and ±n-xrlvo interactions of HDAC9 

Given the expression patterns of the HDAC9 gene in the 
normal and malignant cells deriving from the B cell 
5 lineage, we examined whether its products could interact 
with any of the proteins whose function has been 
implicated in B-cell malignancies (Fig. 9) . As expected, 
HDAC9 was found to interact with Diffuse Large Cell 
Lymphoma and pre-B cell acute lymphoblastic leukaemia 

10 associated transcriptional repressors BCL-6 and TEL, 

respectively. HDAC9 also interacted with PLZF, HDAC 4 and 
to a lesser degree class I HDACs 1 and 3. In addition 
HDAC 9 was found to interact with the nuclear receptor co- 
mpressors mSin3A and B and N-CoR, whose activities have 

15 been implicated in the mechanism underlying pathogeneses 
of several human cancers. 

Fig. 10 shows the cellular localization of various HDAC9 
isoforms when visualized with anti-FLAG antibody and an 

20 antibody to the N-terminal epitope present in all of the 
• HDAC 9 isoforms. Note that HDAC9Aexon7 is completely 
excluded from the nucleus. To further analyse the 
interactions of HDAC 9 isoforms in vivo, we performed the 
co-immunofluorescence assays (Fig. 11-13) . When HDAC 9 was 

25 co-expressed with BCL-6, PLZF and N-CoR, it was shown to 
colocalize with them in-vivo- and to influence their wild 
type localization towards a pattern seen for a given HDAC 9 
isoform. These data indicated those in vitro interactions 
between the HDAC 9 and a number of other proteins (see 

30 above and Fig. 9) can also be observed in vivo. For 

example, when HDAC9Aexon7 was co-expressed with a given 
interaction partner there was a dramatic change in its 
cellular localization, with HDAC9Aexon7 recruiting it to 
the cytoplasm (see Fig. 10). 
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The references mentioned herein are all expressly 
incorporated by reference. 
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Table 1 



Human HDAC9 exon/intron splice junctions 



Exon 


Frame 


Size 


5' Splice Donor 


Intron 


Size 


o bpuce iiccepLor 


1 




109 


TCTAAGCCAG£tttaattggtt 


1 


407 


gtttttcct cagA l gggg I GGC 


2 


1 


63 


ATCAGCTCAGgtaagatcctct 


2 


88956 


ctqqttctttagTGGATGTGAA 


3 


2 


242 


GCATATCAAGgtagcaaatgct 


3a 


4812 


t ct t ct cgcaagTTGCAACAGG 


3 


2 






3b 




aagttgcaacagGAACTTCTAG 


4 


1 


142 


GGACGAGAAAc[taagaggcacc 


4c 






4 


1 




AGGCACCAGGgtaaacgatgga 


4d 


1014 


tgtgtatttcagGGGCAGTGGC 


5 


2 


127 


TCTGGTAC ACc[t a t g 1 1 ca g t g 


5 


2265 


tgtcttttctagGGCTGCCCAC 


6 


3 


122 


CGAAAAACTGcjtaagttggttt 


6 


35320 


ctcaatccccagCCTCTGAGCC 


§ 


2 


132 


GAGGTGACAG£ta a 1 1 g ag ga c 


7 


5145 


aatatttttcagAATCCTCAGT 


8 


2 


116 


TCATGCCGAGgtaagaccctta 


8 


9928 


tttttttaacagCAAATGGTTT 


WW 

1 


1 


123 


CCAGCTCAATgtaagtcattgc 


9 


2991 


ttctcaacacagGCTTCGAATT 


10 


1 


214 


CTTGTAGCTGgtaattcattat 


10 


4 67 


tttttttttcagGTGGAGTTCC 


11 


2 


218 


CATGAACAAAgtaagcctccaa 


11 


17529 


actctcttctagCTGCTTTCGA 


B 


1 


264 


TATGCAACAGgtaataggcaaa 


12 


58741 


tcttggcaacagCCTTTCCTGG 


13 


1 


178 


TCTGCAACTGgtaggaatccct 


13 


21243 


c 1 1 gt c 1 1 a aagGAATTGCCTA 


14 


2 


134 


TAAATGTGAGgtaatccagaat 


14 


13018 


at 1 1 1 cttgcagCGAATTCAAG 


15 


1 


121 


ATACTCCTAGstctgtacgggc 


15 


4828 


cttactgtatagGTGATGACTC 


16 


2 


50 


TGGACTTGGGgtaagt acaagt 


16 


26189 


ctgtttgctcagGTGGACAGTG 


159 

kSa 


1 


108 


AGAGCTGAAGc[tgaggtccggg 


17 


35708 


ttgttttcacacrAATGGGTTTG 


18 


1 


56 


CCACAGCCATgtaagtaccagg 


18 


244 


tctattccgcagGGGGTTCTGC 


19 


3 


88 


TGTAGATCTG£tatgtattcct 


19 


5921 


atttccctgtasGATGTTCACC 


B 


1 


120 


CCCAAATGAGgttcggtttatt 


20 


313 


ttctcttcccagGTTGGAACAG 


21 


1 


98 


AAGCATTCAGsttggtacttct 


21 


38480 


tttactgtgcagGACCATCGTG 


22 


3 


119 


ACGGCAAAATgtaagtacctct 


22 


61212 


gtattatggtagGTTTTGGTCA 


23 


2 


134 


AGGAAATGAGgtaaaaaagtaa 


23 


18203 


ctattcttgcagCTGGAGCCAC 


24 


1 


85 


GAAATTCAAAgtatgtctttaa 


24 


21575 


tgtttttccta^GCAAGTATTG 


25 


2 


148 


AAGACAGCAGgtatgaatccaa 


25 


20069 


ttatttttacagAACTGCTGGT 


26 


3 


40 











Consensus splice donor and acceptor sequences between exons (uppercase) and inttons (lowercase) are 
underlined. Exons highlighted in |§| have the potential to be spliced out in-frame. A exon 7 and 12 
cDNAs have already been detected. The suffixes a, b, c and d refer to the alternative splice sites at etber 
end of exon 4. i.e. 

a b 

cacactctcatqtctttctcttctcqcaagTTGCAACagGAACTTCTAGCCATAAAACAGCAACAAGAACTCCTAGAAAA 
GGAGCAGAAACTGGAGCAGCAGAGGCAAGAACAGGAAGTAGAGAGGCATCGCAGAGAACAGCAGCTTCCTCCTCTCAGAG 
GCAAAGATAGAGGACGAGAAA2tAAGAGGCACCAGG2taaacgatggactctctttcctcatcgttagctgatcattatt 

c d 

Areas shown in italics at either end of Exon4 refer to alternatively splices regions of the exon that have 
been detected. 
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Table 2 

Genomic org anization of HDAC9 and its MITR isoform 
Exon Clone Accession* 



MITR/ HDAC9 5' 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12a 



CTA-317M2 



CTB-180O1 



HDAC9 3' 

12b 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 



CTB-180O1 
CTB-13P7 



CTA-264L19 

ii 
ii 
tt 

n 

RP5-1194E15 

ii 

GS1-465N13 



AC002433 



AC002124 



AC002124 
AC002088 



AC002410 
ii 

i> 

» 
i> 

AC004994 

ii 

AC004744 



Location on BAC 



43921 -44029 

44434-44499 

133456-133699 

138520-138661 

139691-139817 

2364-2485 

37086-37937 

43083-43198 

53127-53249 

56241-56454 

56922-57139 

74669-77298 



74669-74932 

35119-35296 

56544-56677 

69696-69816 

74645-74694 

15451-15558 

51267-51322 

51567-51654 

57573-57692 

58006-58103 

79771-79643 

18434-18301 

85480-85390 

63816-63669 

43602-43430 



Relative position 



1-110 

517-580 

89536-89779 

94600-94742 

95771-95898 

98163-98283 

133605-133737 

138882-138998 

148926-149049 

152040-152254 

152721-152939 

170468-173098 



170468-170732 
231839-232017 
253260-253394 
266412-266533 
271361-271411 
297600-297708 
333416-333472 
333716-333801 
339722-339842 
340155-340253 
378733-378852 
440064-440198 
458401-458486 
480061-480209 
500278-500449 
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Claims : 

1. An isolated polypeptide having histone deacetylase 
activity, wherein the polypeptide has the amino acid 
sequence of the polypeptide in Figure 2. 

5 

2. An isolated polypeptide having histone deacetylase 
activity, wherein the polypeptide has at least 80% amino 
acid sequence identity with the polypeptide in Figure 2. 

10 3. The isolated polypeptide of claim 2, wherein the 

polypeptide has at least 90% amino acid sequence identity 
.with the polypeptide in Figure 2 and comprises all of 
part of the amino acid sequence as set out between amino 
acids 1009 to 1069 of the polypeptide of Figure 2. 

15 

4. An isolated polypeptide having histone deacetylase 
activity, wherein the polypeptide is encoded by a nucleic 
acid sequence capable of hybridising under stringent 
conditions to the nucleotide sequence of Figure 2, or a 

20 complement thereof, and wherein the nucleic acid sequence 
comprises a portion able to hybridise under stringent 
conditions to a nucleic acid sequence encoding amino 
acids 1009 to 1069 of the polypeptide of Figure 2 r or a 
complement thereof . 

25 

5. An isolated polypeptide which is variant, fragment 
or active portion of a polypeptide according to any one 
of the preceding claims, wherein said variant, fragment 
or active portion has histone deacetylase activity. 

30 

6. The isolated polypeptide according to claim 5, 
wherein the polypeptide comprises all of part of the 
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10 



amino acid sequence as set out between amino acids 1009 
to 1069 of the polypeptide sequence of Figure 2. 

7. The isolated polypeptide of claim 5 which is a 
variant in which one or more of the exons shown in Figure 
2 is deleted. 

8. The isolated polypeptide variant of claim 7 in which 
exon 7, exon 12 and/or exon 15 are deleted. 

9. The isolated polypeptide of any one of the preceding 
claims which is joined to a coupling partner. 

10. The isolated polypeptide of claim 9, wherein the 
15 coupling partner is an effector molecule, a label, a 

.drug, a toxin or a carrier or transport molecule. 

11. An isolated nucleic acid molecule encoding a 
polypeptide of any one of the preceding claims. 

20 

12. An isolated nucleic acid molecule having a sequence 
as shown in Figure 2 and which encodes a polypeptide 
having histone deacetylase activity. 

25 13. An expression vector comprising a nucleic acid, of 
claim 11 or claim 12, operably linked to control 
sequences to direct its expression. 

14. A host cell transformed with the expression vector 
30 of claim 13. 

15. A method of producing a histone deacetylase 
polypeptide of any one of claims 1 to 10, the method 
comprising culturing the host cells of claim 14 and 
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isolating the polypeptide thus produced. 

16. A composition comprising a histone deactylase 9 
polypeptide of any one of claims 1 to 10. 

5 

17. A composition comprising a nucleic acid molecule of 
claim 11 or claim 12. 

18. Use of a histone deacetylase 9 (HDAC9) polypeptide 
10 of any one of claims 1 to 10, or a nucleic acid molecule 

encoding the polypeptide, for screening for candidate 
compounds which (a) share a histone deacetylase 9 
biological activity or (b) bind to the histone 
deacetylase 9 polypeptide or (c) inhibit a biological 
15 activity of a histone deacetylase 9 polypeptide. 

19. The use of claim 18, wherein the biological activity 
of the HDAC9 polypeptide is activity as a histone 
deacetylase. 

20 

20. A method of identifying a compound which is capable 
of modulating an activity of a histone deacetylase 9 
polypeptide (HDAC9) of any one of claims 1 to 10, the 
method comprising: 

25 (a) contacting at least one candidate compound with 

the HDAC9 polypeptide under conditions in which the 
candidate compound and HDAC9 polypeptide are capable of 
interacting; 

(b) determining in an assay for a HDAC9 activity 
30 • whether the candidate compound modulates the activity; 

and, 

(c) selecting a candidate compound which modulates 
an activity of the HDAC9 polypeptide. 
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.21. The method of claim 20, wherein the modulating is 
binding of the candidate compound to the HDAC9 
polypeptide. 

5 22. The method of claim 20, wherein the modulating is' 
the candidate compound inhibiting an activity of the 
HDAC9 polypeptide. 

23. The method of claim 22, wherein the activity is 
10 histone deacetylase activity. 

24. The method of claim 23, which comprises contacting 
the HDAC9 polypeptide with a histone substrate in the 
presence or absence of the candidate compound and 

15 determining the amounts of acetylated and deacetylated 
histone substrate present after reaction with the HDAC9 
polypeptide . 

25. The method of claim 24, wherein the histone 
20 substrate is histone 3 or histone 4. 

26. The method of claim 24 or claim 25, wherein the 
amounts of acetylated and deacetylated histone substrate 
is determined using antibodies which are capable of 

25 specifically binding to acetylated or deacetylated 
histone substrate. 

27. The method of claim 24 or claim 25, wherein the 
histone substrate has a label which is releasable from 

30 the substrate by the HDAC9, so that the amounts of 
acetylated and deacetylated histone substrate are 
determined by measuring the amount of label released from 
the substrate by the action of the HDAC9 polypeptide. 
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28. The method of claim 27, wherein the label is a 
radioactive label or a fluorescent label. 

29. A method of identifying a compound which is capable 
5 of inhibiting histone deacetylase 9 (HDAC9) polypeptide 

of any one of claims 1 to 10, the method comprising: 

(a) contacting at least one candidate compound and 
the HDAC9 polypeptide in the presence of a substrate for 
HDAC9 under conditions in which the candidate compound, 

10 HDAC9 polypeptide and HDAC9 substrate are capable of 
interacting; 

(b) determining whether the candidate compound 
inhibits the activity of the HDAC9 polypeptide in 
reacting with the substrate; and, 

15 (c) selecting the candidate compound which inhibits 

the activity of the HDAC9 polypeptide on the substrate. 

30. The method of claim 29, wherein the method employs a 
cell based reporter assay comprising (a) cells which are 

20 capable of producing HDAC9 polypeptide fused to a nucleic 
acid binding domain and (b) a reporter construct 
comprising (i) a promoter having a nucleic acid sequence 
that can be bound by the nucleic acid binding domain, 
thereby to activate the promoter, and (ii) a reporter 

25 under the transcriptional control of the promoter. 

31. The method of claim 30, wherein the presence of 
active HDAC9 polypeptide causes deacetylation of core 
histones associated with the nucleic acid binding domain . 

30 of the reporter and down regulation of promoter activity. 

32. The method of claim 30 or claim 31, wherein the 
method is employed to identify candidate compounds that 
inhibit HDAC9 activity and counteract the down regulation 
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of the promoter. 

33. The method of any one of claims 30 to 32, wherein 
the nucleic acid binding domain is a GAL 4 domain, and the 

5 HDAC9-GAL4 protein which is capable of interacting with a 
GAL4 nucleic acid binding site of the promoter, so that 
• HDAC9 polypeptide activity causes deacetylation of core 
histones associated with a DNA surrounding GAL 4 binding 
site, thereby causing down regulation of promoter 
10 activity. 

34. A method which comprises, having identified a 
candidate compound by the method of any one of claims 20 
to 33, the step of manufacturing the compound and/or 

15 formulating it in a composition. 

35. An antibody which is capable of specifically binding 
to a histone deacetylase 9 (HDAC9) polypeptide of any one 
of claims 1 to 10. 

20 

36. Use of an antibody of claim 35 in an assay to detect 
and quantify the presence of HDAC9 polypeptide, or an 
isoform thereof. 

25 37. Use of an antibody of claim 35 for purifying s HDAC9 
polypeptide. 

38. Use of an antibody of claim 35 as an inhibitor of a 
HDAC9 polypeptide. 

30 

39. A method for the diagnosis or prognosis of cancer, 
the method comprising determining the presence or amount 
of histone deacetylase 9 (HDAC9) polypeptide of any one 
of claims 1 to 10, or an isoform thereof, or nucleic acid 
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encoding a HDAC9 polypeptide, in a sample from a patient. 

40. The method of claim 39, wherein the cancer is a 
leukaemia . 

5 

41- The method of claim 40 , wherein the leuakaemia is 
TEL-AML1 positive and negative pre-B cell acute 
lymphoblastic leukaemia or B cell lymphoma 

10 42. The method of any one of claims 39 to 41, wherein the 
method comprises: 

(a) determining the presence or amount of HDAC9 
polypeptide in a sample from a patient, by measuring an 
activity of the HDAC9 polypeptide or its presence in a 

15 binding assay; or, 

(b) determining the presence of HDAC9 nucleic acid 
using a probe capable of hybridising to the HDAC9 nucleic 
acid; 

(c) using PGR involving one or more primers based on 
20 a HDAC9 nucleic acid sequence to determine whether the 

HDAC9 transcript is present in a sample from a patient. 

43. The method of any one of claims 39 to 42, wherein the 
method comprises the steps of: 
25 (a) contacting a sample obtained from the patient 

with a solid support having immobilised thereon binding 
agent having binding sites specific for HDAC9 polypeptide 
or HDAC9 nucleic acid; 

(b) contacting the solid support with a labelled 

30 developing agent capable of binding to unoccupied binding 
sites, bound HDAC9 polypeptide or nucleic acid or occupied 
binding sites; and, 

(c) detecting the label of the developing agent 
specifically binding in step (b) to obtain a value 

35 
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representative of the presence or amount of the HDAC9 
polypeptide or nucleic acid in the sample. 

44. Use of an inhibitor of histone deacetylase 9 (HDAC9) 
5 for the preparation of a medicament for the treatment of 

cancer. 

45. The use of claim 44, wherein the cancer is a 
leukaemia. 

10 

46. The use of claim 45, wherein the leukaemia is TEL- 
AML1 positive and negative pre-B cell acute lymphoblastic 
leukaemia or B cell lymphoma. 

15 
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HDAC9 vs HDAC1 
HDAC9 vs HDAC4 
HDAC9 vs HDAC5 
HDAC9 vs HDAC7 
HDAC9 vs HDAC6 



Entire Protein 
Identity (%) Similarity (%) 



26 
56 
57 
43 
17 



45 
70 
71 
54 
28 



Catalytic Domain 
Identity (%) Similarity (%) 



26 
70 
72 
66 

N-term 39 
C-term 44 



45 
83 
84 
77 
56 
60 



Fig. 4B 
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HDAC6 
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HDAC7 



HDAC5 



HDAC9 



Fig. 4C 
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